AAAI.2021 - Demonstration Track

Total: 46

#1 A Semantic Parsing and Reasoning-Based Approach to Knowledge Base Question Answering [PDF] [Copy] [Kimi]

Authors: Ibrahim Abdelaziz ; Srinivas Ravishankar ; Pavan Kapanipathi ; Salim Roukos ; Alexander Gray

Knowledge Base Question Answering (KBQA) is a task where existing techniques have faced significant challenges, such as the need for complex question understanding, reasoning, and large training datasets. In this work, we demonstrate Deep Thinking Question Answering (DTQA), a semantic parsing and reasoning-based KBQA system. DTQA (1) integrates multiple, reusable modules that are trained specifically for their individual tasks (e.g. semantic parsing, entity linking, and relationship linking), eliminating the need for end-to-end KBQA training data; (2) leverages semantic parsing and a reasoner for improved question understanding. DTQA is a system of systems that achieves state-of-the-art performance on two popular KBQA datasets.

#2 Exploring the Efficacy of Generic Drugs in Treating Cancer [PDF] [Copy] [Kimi]

Authors: Ioana Baldini ; Mariana Bernagozzi ; Sulbha Aggarwal ; Mihaela Bornea ; Saksham Chawla ; Joppe Geluykens ; Dmitriy A. Katz-Rogozhnikov ; Pratik Mukherjee ; Smruthi Ramesh ; Sara Rosenthal ; Jagrati Sharma ; Kush R. Varshney ; Laura B. Kleiman ; Pradeep Mangalath ; Catherine Del Vecchio Fitz

Thousands of scientific publications discuss evidence on the efficacy of non-cancer generic drugs being tested for cancer. However, trying to manually identify and extract such evidence is intractable at scale. We introduce a natural language processing pipeline to automate the identification of relevant studies and facilitate the extraction of therapeutic associations between generic drugs and cancers from PubMed abstracts. We annotate datasets of drug-cancer evidence and use them to train models to identify and characterize such evidence at scale. To make this evidence readily consumable, we incorporate the results of the models in a web application that allows users to browse documents and their extracted evidence. Users can provide feedback on the quality of the evidence extracted by our models. This feedback is used to improve our datasets and the corresponding models in a continuous integration system. We describe the natural language processing pipeline in our application and the steps required to deploy services based on the machine learning models.

#3 OzoMorph: Demonstrating Colored Multi-Agent Path Finding on Real Robots [PDF] [Copy] [Kimi]

Authors: Roman Barták ; Jakub Mestek

Multi-agent Path Finding (MAPF) deals with finding collision-free paths for a set of agents on a graph, where each agent has its origin and destination. Colored MAPF is a generalization of MAPF, where groups of agents are moving, and the set of destination nodes is specified per group rather than per agent. OzoMorph is software providing an intuitive user interface for specifying Colored MAPF problems, solving them by translation to SAT, and finally visualizing the solution either in a computer simulation or by converting the plans to executable instructions for Ozobot Evo robots.

#4 VEGA: a Virtual Environment for Exploring Gender Bias vs. Accuracy Trade-offs in AI Translation Services [PDF] [Copy] [Kimi]

Authors: Mariana Bernagozzi ; Biplav Srivastava ; Francesca Rossi ; Sheema Usmani

Machine translation services are a very popular class of Artificial Intelligence (AI) services nowadays but public's trust in these services is not guaranteed since they have been shown to have issues like bias. In this work, we focus on the behavior of machine translators with respect to gender bias as well as their accuracy. We have created the first-of-its-kind virtual environment, called VEGA, where the user can interactively explore translation services and compare their trust ratings using different visuals.

#5 A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices [PDF] [Copy] [Kimi]

Authors: Yuxuan Cai ; Geng Yuan ; Hongjia Li ; Wei Niu ; Yanyu Li ; Xulong Tang ; Bin Ren ; Yanzhi Wang

The rapid development and wide utilization of object detection techniques have aroused requirements for both accuracy and speed of object detectors. In this work, we propose a compression-compilation co-design framework to achieve real-time YOLOv4 inference on mobile devices. We propose a novel fine-grained structured pruning, which maintain high accuracy while achieving high hardware parallelism. Our pruned YOLOv4 achieves 48.9 mAP and 17 FPS inference speed on an off-the-shelf Samsung Galaxy S20 smartphone, which is 5.5x faster than the original state-of-the-art detector YOLOv4.

#6 AutoText: An End-to-End AutoAI Framework for Text [PDF] [Copy] [Kimi]

Authors: Arunima Chaudhary ; Alayt Issak ; Kiran Kate ; Yannis Katsis ; Abel Valente ; Dakuo Wang ; Alexandre Evfimievski ; Sairam Gurajada ; Ban Kawas ; Cristiano Malossi ; Lucian Popa ; Tejaswini Pedapati ; Horst Samulowitz ; Martin Wistuba ; Yunyao Li

Building models for natural language processing (NLP) tasks remains a daunting task for many, requiring significant technical expertise, efforts, and resources. In this demonstration, we present AutoText, an end-to-end AutoAI framework for text, to lower the barrier of entry in building NLP models. AutoText combines state-of-the-art AutoAI optimization techniques and learning algorithms for NLP tasks into a single extensible framework. Through its simple, yet powerful UI, non-AI experts (e.g., domain experts) can quickly generate performant NLP models with support to both control (e.g., via specifying constraints) and understand learned models.

#7 A Health-friendly Speaker Verification System Supporting Mask Wearing [PDF] [Copy] [Kimi]

Authors: Chaotao Chen ; Di Jiang ; Jinhua Peng ; Rongzhong Lian ; Chen Jason Zhang ; Qian Xu ; Lixin Fan ; Qiang Yang

We demonstrate a health-friendly speaker verification system for voice-based identity verification on mobile devices. The system is built upon a speech processing module, a ResNet-based local acoustic feature extractor and a multi-head attention-based embedding layer, and is optimized under an additive margin softmax loss for discriminative speaker verification. It is shown that the system achieves superior performance no matter whether there is mask wearing or not. This characteristic is important for speaker verification services operating in regions affected by the raging coronavirus pneumonia. With this demonstration, the audience will have an in-depth experience of how the accuracy of bio-metric verification and the personal health are simultaneously ensured. We wish that this demonstration would boost the development of next-generation bio-metric verification technologies.

#8 An Intelligent Assistant for Problem Behavior Management [PDF] [Copy] [Kimi]

Authors: Penghe Chen ; Yu Lu ; Jiefei Liu ; Qi Xu

We design and implement an intelligent assistant, called PB-Advisor, to advise teachers and parents on students' problem behaviors. It utilizes a task-oriented dialogue system to identify the need deficiency underlying students' problem behaviors, and relies on a community question answering system to provide advice on typical problem behavior management. In addition, it also provides various learning resources, and illustrates the relations between influential factors on typical problem behaviors through data analysis. With PB-Advisor, teachers and parents without psychological expertise can easily find proper advice on students’ problem behaviors.

#9 OPRA: An Open-Source Online Preference Reporting and Aggregation System [PDF] [Copy] [Kimi]

Authors: Yiwei Chen ; Jingwen Qian ; Junming Wang ; Lirong Xia ; Gavriel Zahavi

We introduce the Online Preference Reporting and Aggregation (OPRA) system, an open-source online system that aims at providing support for group decision-making. We illustrate OPRA's distinctive features: UI for reporting rankings with ties, comprehensive analytics of preferences, and group decision-making in combinatorial domains. We also discuss our work in an automatic mentor matching system. We hope that the open-source nature of OPRA will foster development of computerized group decision support systems.

#10 ESO-MAPF: Bridging Discrete Planning and Continuous Execution in Multi-Agent Pathfinding [PDF] [Copy] [Kimi]

Authors: Ján Chudý ; Pavel Surynek

We present ESO-MAPF, a research and educational platform for experimenting with multi-agent path finding (MAPF). ESO-MAPF focuses on demonstrating the planning-acting chain in the MAPF domain. MAPF is the task of finding collision free paths for agents from their starting positions to given individual goals. The standard MAPF uses the abstraction where agents move in an undirected graph via traversing its edges in discrete steps. The discrete abstraction simplifies the planning phase however resulting discrete plans often need to be executed in the real continuous environment. ESO-MAPF shows how to bridge discrete planning and the acting phase in which the resulting plans are executed on physical robots. We simulate centralized plans on a group of OZOBOT Evo robots using their reflex functionalities and outputs on the surface of the screen that serves as the environment. Various problems arising along the planning-acting chain are illustrated to emphasize the educational point of view.

#11 Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback [PDF] [Copy] [Kimi]

Authors: Yuchen Cui ; Qiping Zhang ; Sahil Jain ; Alessandro Allievi ; Peter Stone ; Scott Niekum ; W. Bradley Knox

Reactions such as gestures, facial expressions, and vocalizations are an abundant, naturally occurring channel of information that humans provide during interactions. An agent could leverage an understanding of such implicit human feedback to improve its task performance at no cost to the human. This approach contrasts with common agent teaching methods based on demonstrations, critiques, or other guidance that need to be attentively and intentionally provided. In this work, we demonstrate a novel data-driven framework for learning from implicit human feedback, EMPATHIC. This two-stage method consists of (1) mapping implicit human feedback to relevant task statistics such as reward, optimality, and advantage; and (2) using such a mapping to learn a task. We instantiate the first stage and three second-stage evaluations of the learned mapping. To do so, we collect a dataset of human facial reactions while participants observe an agent execute a sub-optimal policy for a prescribed training task. We train a deep neural network on this data and demonstrate its ability to (1) infer relative reward ranking of events in the training task from prerecorded human facial reactions; (2) improve the policy of an agent in the training task using live human facial reactions; and (3) transfer to a novel domain in which it evaluates robot manipulation trajectories. In the video, we focus on demonstrating the online learning capability of our instantiation of EMPATHIC.

#12 Juice: A Julia Package for Logic and Probabilistic Circuits [PDF] [Copy] [Kimi]

Authors: Meihua Dang ; Pasha Khosravi ; Yitao Liang ; Antonio Vergari ; Guy Van den Broeck

Juice is an open-source Julia package providing tools for logic and probabilistic reasoning and learning based on logic circuits (LCs) and probabilistic circuits (PCs). It provides a range of efficient algorithms for probabilistic inference queries, such as computing marginal probabilities (MAR), as well as many more advanced queries. Certain structural circuit properties are needed to achieve this tractability, which Juice helps validate. Additionally, it supports several parameter and structure learning algorithms proposed in the recent literature. By leveraging parallelism (on both CPU and GPU), Juice provides a fast implementation of circuit-based algorithms, which makes it suitable for tackling large-scale datasets and models.

#13 Bootstrapping Dialog Models from Human to Human Conversation Logs [PDF] [Copy] [Kimi]

Authors: Pankaj Dhoolia ; Vineet Kumar ; Danish Contractor ; Sachindra Joshi

State-of-the-art commercial dialog platforms provide powerful tools to build a conversational agent. These platforms provide complete control to the dialog designer to model user-agent interactions. However, a dialog designer needs to rely on domain experts to manually build the dialog model -- by creating dialog flow nodes and modeling user intents. This process is laborious, time consuming and expensive and does not allow the designer to exploit human to human conversation logs effectively. In this work, we present a research prototype that can ingest human-to-human conversation logs between an end-user and an agent, and suggest user-intents and agent-responses, given a conversation context. We utilize human to human conversation logs to build two emulators: user and agent. An agent emulator models an agent response given the conversation context so far, and a user emulator outputs possible user responses. Our system is able to recommend conversational intents as well as conversation flow using emulators based on real-world data, thus making the process of designing a bot more efficient. To the best our knowledge this is the first system that enables data-driven dialog model creation by emulating users and agents.

#14 Doc2Bot: Document grounded Bot Framework [PDF] [Copy] [Kimi]

Authors: Kshitij Fadnis ; Pankaj Dhoolia ; Li Zhu ; Q. Vera Liao ; Steven Ross ; Nathaniel Mills ; Sachindra Joshi ; Luis Lastras

Conversational agents, or chatbots, are widely used to provide customer care and other informational support. Currently, the development of chatbots using standard frameworks requires a lot of manual crafting by subject matter experts (SMEs). On the other hand, while learning-based approaches to dialog have made significant advancements, they require training with a large volume of dialog data, which chatbot developers typically do not have access to. To tackle these challenges, we introduce DOC2BOT, a system that supports the automated construction of chatbots by digesting various forms of documents such as business manuals, HowTos, and customer support pages that organizations own. In addition to that, DOC2BOT provides a user-friendly experience to SMEs, and to minimize their effort by supporting intuitive interactions and streamlining their workflow.

#15 KAAPA: Knowledge Aware Answers from PDF Analysis [PDF] [Copy] [Kimi]

Authors: Nicolas Fauceglia ; Mustafa Canim ; Alfio Gliozzo ; Jennifer J Liang ; Nancy Xin Ru Wang ; Douglas Burdick ; Nandana Mihindukulasooriya ; Vittorio Castelli ; Guy Feigenblat ; David Konopnicki ; Yannis Katsis ; Radu Florian ; Yunyao Li ; Salim Roukos ; Avirup Sil

We present KaaPa (Knowledge Aware Answers from Pdf Analysis), an integrated solution for machine reading comprehension over both text and tables extracted from PDFs. KaaPa enables interactive question refinement using facets generated from an automatically induced Knowledge Graph. In addition it provides a concise summary of the supporting evidence for the provided answers by aggregating information across multiple sources. KaaPa can be applied consistently to any collection of documents in English with zero domain adaptation effort. We showcase the use of KaaPa for QA on scientific literature using the COVID-19 Open Research Dataset.

#16 IBM Scenario Planning Advisor: A Neuro-Symbolic ERM Solution [PDF] [Copy] [Kimi]

Authors: Mark Feblowitz ; Oktie Hassanzadeh ; Michael Katz ; Shirin Sohrabi ; Kavitha Srinivas ; Octavian Udrea

Scenario Planning is a commonly used Enterprise Risk Management (ERM) technique to help decision makers with longterm plans by considering multiple alternative futures. It is typically a manual, highly labor intensive process involving dozens of experts and hundreds to thousands of person-hours. We previously introduced a Scenario Planning Advisor prototype (Sohrabi et al. 2018a,b) that focuses on generating scenarios quickly based on expert-developed models. We present the evolution of that prototype into a full-scale, cloud deployed ERM solution that: (i) can automatically (through NLP) create models from authoritative documents such as books, reports and articles, such that what typically took hundreds to thousands of person-hours can now be achieved in minutes to hours; (ii) can gather news and other feeds relevant to forces in the risk models and group them into storylines without any other user input; (iii) can generate scenarios at scale, starting with dozens of forces of interest from models with thousands of forces in seconds; (iv) provides interactive visualizations of scenario and force model graphs, including a full model editor in the browser. The SPA solution is deployed under a non-commercial use license at https://spa-service.draco.res.ibm.com and includes a user guide to help new users get started. A video demonstration is available at https://www.youtube.com/watch?v=IaX3d37NUl8.

#17 NEO: A System for Identifying New Emerging Occupation from Job Ads [PDF] [Copy] [Kimi]

Authors: Anna Giabelli ; Lorenzo Malandri ; Fabio Mercorio ; Mario Mezzanzanica ; Andrea Seveso

We demonstrate NEO, a tool for automatically enriching the European Occupation and Skill Taxonomy (ESCO) with terms that represents new occupations extracted from million Online Job Advertisements (OJAs). NEO proposes (i) a novel metric that allows one to measure the semantic similarity between words in a taxonomy, and (ii) a set of measures that estimate the adherence of new terms to the most suited taxonomic concept, enabling the user to evaluate the suggestions. To test its effectiveness, NEO has been evaluated over 2M+ 2018 UK job ads, along with a user-study to confirm the usefulness of NEO in the taxonomy enrichment task.

#18 Dialog Router: Automated Dialog Transition via Multi-Task Learning [PDF] [Copy] [Kimi]

Authors: Ziming Huang ; Zhuoxuan Jiang ; Hao Chen ; Xue Han ; Yabin Dang

Dialog Router is a general paradigm for human-bot symbiosis dialog systems to provide friendly customer care service. It is equipped with a multi-task learning model to automatically capture the underlying correlation between multiple related tasks, i.e. dialog classification and regression, and greatly reduce human labor work for system customization, which improves the accuracy of dialog transition. In addition, for learning the multi-task model, the training data and labels are easy to collect from human-to-human historical dialog logs, and the Dialog Router can be easily integrated into the majority of existing dialog systems by calling general APIs. We conduct experiments on real-world datasets for dialog classification and regression. The results show that our model achieves improvements on both tasks, which benefits the dialog transition application. The demo illustrates our method’s effectiveness in a real customer care service.

#19 EasyRL: A Simple and Extensible Reinforcement Learning Framework [PDF] [Copy] [Kimi]

Authors: Neil Hulbert ; Sam Spillers ; Brandon Francis ; James Haines-Temons ; Ken Gil Romero ; Benjamin De Jager ; Sam Wong ; Kevin Flora ; Bowei Huang ; Athirai A. Irissappane

In recent years, Reinforcement Learning (RL), has become a popular field of study as well as a tool for enterprises working on cutting-edge artificial intelligence research. To this end, many researchers have built RL frameworks such as openAI Gym, and KerasRL for ease of use. While these works have made great strides towards bringing down the barrier of entry for those new to RL, we propose a much simpler framework called EasyRL, by providing an interactive graphical user interface for users to train and evaluate RL agents. As it is entirely graphical, EasyRL does not require programming knowledge for training and testing simple built-in RL agents. EasyRL also supports custom RL agents and environments, which can be highly beneficial for RL researchers in evaluating and comparing their RL models.

#20 AI-Empowered Decision Support for COVID-19 Social Distancing [PDF] [Copy] [Kimi]

Authors: Hongchao Jiang ; Wei Yang Bryan Lim ; Jer Shyuan Ng ; Harold Ze Chie Teng ; Han Yu ; Zehui Xiong ; Dusit Niyato ; Chunyan Miao

The COVID-19 pandemic is one of the most severe challenges the world faces today. In order to contain the transmission of COVID-19, people around the world have been advised to practise social distancing. However, maintaining social distance is a challenging problem, as we often do not know beforehand how crowded the places we intend to visit are. In this paper, we demonstrate crowded.sg, an AI-empowered platform that leverages on Unmanned Aerial Vehicles (UAVs), crowdsourced images, and computer vision techniques to provide social distancing decision support.

#21 Mobile-based Clock Drawing Test for Detecting Early Signs of Dementia [PDF] [Copy] [Kimi]

Authors: Hongchao Jiang ; Yanci Zhang ; Zhiwei Zeng ; Jun Ji ; Yu Wang ; Ying Chi ; Chunyan Miao

Dementia is one of the major causes of disability and dependency among older people. Early detection is the key for preserving the quality of life of the patients and reducing caring costs. The Clock Drawing Test (CDT) is commonly used by clinicians to screen for early signs of dementia. We build an automated CDT that runs on mobile platforms, enabling convenient and frequent self-monitoring and testing at minimal costs. Our system combines both a spatial-temporal approach and a purely image-based deep learning approach to analyze and evaluate the hand-drawn clocks based on established clinical criteria. Our system produces scores that are highly correlated with expert human raters.

#22 RADAR-X: An Interactive Interface Pairing Contrastive Explanations with Revised Plan Suggestions [PDF] [Copy] [Kimi]

Authors: Valmeekam Karthik ; Sarath Sreedharan ; Sailik Sengupta ; Subbarao Kambhampati

Automated Planning techniques can be leveraged to build effective decision support systems that assist the human-in-the-loop. Such systems must provide intuitive explanations when the suggestions made by these systems seem inexplicable to the human. In this regard, we consider scenarios where the user questions the system's suggestion by providing alternatives (referred to as foils). In response, we empower existing decision support technologies to engage in an interactive explanatory dialogue with the user and provide contrastive explanations based on user-specified foils to reach a consensus on proposed decisions. To provide contrastive explanations, we adapt existing techniques in Explainable AI Planning (XAIP). Furthermore, we use this dialog to elicit the user's latent preferences and propose three modes of interaction that use these preferences to provide revised plan suggestions. Finally, we showcase a decision support system that provides all these capabilities.

#23 Business Entity Matching with Siamese Graph Convolutional Networks [PDF] [Copy] [Kimi]

Authors: Evgeny Krivosheev ; Mattia Atzeni ; Katsiaryna Mirylenka ; Paolo Scotton ; Christoph Miksovic ; Anton Zorin

Data integration has been studied extensively for decades and approached from different angles. However, this domain still remains largely rule-driven and lacks universal automation. Recent developments in machine learning and in particular deep learning have opened the way to more general and efficient solutions to data-integration tasks. In this paper, we demonstrate an approach that allows modeling and integrating entities by leveraging their relations and contextual information. This is achieved by combining siamese and graph neural networks to effectively propagate information between connected entities and support high scalability. We evaluated our approach on the task of integrating data about business entities, demonstrating that it outperforms both traditional rule-based systems and other deep learning approaches.

#24 Democratizing Constraint Satisfaction Problems through Machine Learning [PDF] [Copy] [Kimi]

Authors: Mohit Kumar ; Samuel Kolb ; Clement Gautrais ; Luc De Raedt

Constraint satisfaction problems (CSPs) are used widely, especially in the field of operations research, to model various real world problems like scheduling or planning. However,modelling a problem as a CSP is not trivial, it is labour intensive and requires both modelling and domain expertise. The emerging field of constraint learning deals with this problem by automatically learning constraints from a given dataset. While there are several interesting approaches for constraint learning, these works are hard to access for a non-expert user. Furthermore, different approaches have different underlying formalism and require different setups before they can be used. This demo paper combines these researches and brings it to non-expert users in the form of an interactive Excel plugin. To do this, we translate different formalism for specifying CSPs into a common language, which allows multiple constraint learners to coexist, making this plugin more powerful than individual constraint learners. Moreover, we integrate learning of CSPs from data with solving them, making it a self sufficient plugin. For the developers of different constraint learners, we provide an API that can be used to integrate their work with this plugin by implementing a handful of functions.

#25 TODS: An Automated Time Series Outlier Detection System [PDF] [Copy] [Kimi]

Authors: Kwei-Herng Lai ; Daochen Zha ; Guanchu Wang ; Junjie Xu ; Yue Zhao ; Devesh Kumar ; Yile Chen ; Purav Zumkhawaka ; Minyang Wan ; Diego Martinez ; Xia Hu

We present TODS, an automated Time Series Outlier Detection System for research and industrial applications. TODS is a highly modular system that supports easy pipeline construction. The basic building block of TODS is primitive, which is an implementation of a function with hyperparameters. TODS currently supports 70 primitives, including data processing, time series processing, feature analysis, detection algorithms, and a reinforcement module. Users can freely construct a pipeline using these primitives and perform end- to-end outlier detection with the constructed pipeline. TODS provides a Graphical User Interface (GUI), where users can flexibly design a pipeline with drag-and-drop. Moreover, a data-driven searcher is provided to automatically discover the most suitable pipelines given a dataset. TODS is released under Apache 2.0 license at https://github.com/datamllab/tods. A video is available on YouTube (https://youtu.be/JOtYxTclZgQ)